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Abstract 

In this work, we introduce decentralized network interdiction games , which model the interactions 
among multiple interdictors with differing objectives operating on a common network. As a starting 
point, we focus on decentralized shortest path interdiction (DSPI) games, where multiple interdictors 
try to increase the shortest path lengths of their own adversaries, who all attempt to traverse a com¬ 
mon network. We first establish results regarding the existence of equilibria for DSPI games under 
both discrete and continuous interdiction strategies. To compute such an equilibrium, we present a re¬ 
formulation of the DSPI games, which leads to a generalized Nash equilibrium problem (GNEP) with 
non-shared constraints. While such a problem is computationally challenging in general, we show that 
under continuous interdiction actions, a DSPI game can be formulated as a linear complementarity prob¬ 
lem and solved by Lemke’s algorithm. In addition, we present decentralized heuristic algorithms based 
on best response dynamics for games under both continuous and discrete interdiction strategies. Finally, 
we establish theoretical bounds on the worst-case efficiency loss of equilibria in DSPI games, with such 
loss caused by the lack of coordination among noncooperative interdictors, and use the decentralized 
algorithms to empirically study the average-case efficiency loss. 


1 Introduction 


In an interdiction problem, an agent attempts to limit the actions of an adversary operating on a network 
by intentionally disrupting certain components of the network. Such problems are usually modeled in the 
framework of leader-follower games and can be formulated as bilevel optimization problems. Interdiction 
models have been used in various military and homeland security applications, such as dismantling drug 
traffic networks 1451, preventing nuclear smuggling [30j and planning tactical air - strikes 1221. Interdiction 
models have also found applications in other areas such as controlling the spread of pandemics [3| and 
defending attacks on computer communication networks [39]. 

Traditionally, interdiction problems have been analyzed from a centralized perspective; namely, a single 
agent is assumed to analyze, compute and implement interdiction strategies. In many situations, however, it 
might be desirable and even necessary to consider an interdiction problem from a decentralized perspective. 
For instance, a supervising body, in control of multiple agents in a common system, may assign each agent to 
an adversary of interest. Each agent is then responsible for computing and implementing its own interdiction 
strategy against the designated adversary. Other situations may involve multiple independent agents, such 
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as security agencies of different countries, trying to achieve a common goal on a shared network. Without 
any coordination between the agents, one might expect that a decentralized interdiction strategy may be 
inefficient compared to one determined by a central decision maker. This paper is focused on modeling and 
analyzing such settings and the inefficiencies that may arise. 

In this paper, we introduce decentralized network interdiction (DNI) games, in which multiple agents 
with differing objectives are interested in interdicting parts of a common network. We focus on a specific 
class of these games, which we call decentralized shortest path interdiction (DSPI) games. We investi¬ 
gate various properties of equilibria in DSPI games, including their existence and uniqueness, and propose 
algorithms to compute equilibria of these games. Using these algorithms, we also conduct empirical stud¬ 
ies on the efficiency loss of equilibria in the DSPI game compared to optimal solutions obtained through 
centralized decision making. 

Decentralized network interdiction games, as will be formally defined in Section 2, appear to be new. 
To the best of our knowledge, there has been no previous research on such games. As a result, not much 
is known about the inefficiency of equilibria for these games or intervention strategies to reduce such in¬ 
efficiencies. There has been a considerable amount of work, however, on interdiction problems from a 
centralized decision-maker’s perspective. As mentioned earlier, interdiction problems have been studied in 
the context of various military and security applications. For extensive reviews of the existing academic 
literature on interdiction problems, we refer the readers to Church et al. |8j and Smith and Lim |[39|. 

There have also been many studies on the inefficiency of equilibria in other game-theoretic settings. 
Most of the efforts have been focused on routing games |(33j|44|, in which selfish agents route traffic through 


a congested network, and congestion games [361, a generalization of routing games. Some examples include 
Emm 37][40|. Several researchers have also studied the inefficiency of equilibria in network formation 
games, in which agents form a network subject to potentially conflicting connectivity goals [ll}[2}[T5l. The 


inefficiency of equilibria has been studied in other games as well, such as facility location games [43 [, 
scheduling games E8|, and resource allocation games (25] [26]. Almost all of the work described above 


study the worst-case inefficiency of a given equilibrium concept. Although a few researchers have studied 
the average inefficiency of equilibria, either theoretically or empirically, and have used it as a basis to design 
interventions to reduce the inefficiency of equilibria (T0|4l |, research in this direction has not received much 
attention. 

One potential reason for the lack of attention paid to decentralized network interdiction games may 
be that such games often involve nondifferentiability, as each interdictor’s optimization problem usually 
entails a max-min type of objective functions. Games involving nondifferentiable functions are generally 
challenging, in terms of both theoretical analysis of their equilibria and computing an equilibrium. While 
in some cases (such as in the case of shortest path interdiction), a smooth formulation (through total uni¬ 
modularity and duality) is possible, such a reformulation will lead the resulting network game to the class 
of generalized Nash equilibrium problems (GNEPs), in which both the agents’ objective functions as well 
as their feasible action spaces depend on other agents’ actions. Although the conceptual framework of 
GNEPs can be dated to Debreu 1131, rigorous theoretical and algorithmic treatments of GNEPs only be¬ 
gan in recent years (T7J- Several techniques have been proposed to solve GNEPs, including penalty-based 
approaches 118 2TJ, variational-inequality-based approaches [32], Newton’s method (T4| , projection meth¬ 
ods (46j , and relaxation approaches (29f42| . Most of the work on GNEPs has focused on games with shared 
constraints due to their tractability 116 231. In such games, a set of identical constraints appear in each 
agent’s feasible action set. However, as will be seen later, in a typical decentralized network interdiction 
game, the constraints involving multiple agents’ actions that appear in each agent’s action space are not 
identical. As a result, such games give rise to more challenging instances of GNEPs. 

Based on the discussions above, the major contributions of this work are as follows. First, we estab¬ 
lish the existence of equilibria for DSPI games with continuous interdiction. In DSPI games with discrete 
interdiction, the existence of a pure strategy Nash equilibrium (PNE) is more subtle. We first demonstrate 
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that a PNE does not necessarily exist in general discrete DSPI games. However, when all agents have the 
same source-target pairs (i.e., multiple agents try to achieve a common goal independently), a PNE exists 
in discrete DSPI games. Second, for DSPI games under continuous interdiction, we show that each agent’s 
optimization problem can be reformulated as a linear programming problem. As a result, the equilibrium 
conditions of the game can be reformulated as a linear complementarity problem with some favorable prop¬ 
erties, allowing it to be solved by the well-known Lemke algorithm. For discrete DSPI games (and for 
continuous games as well), we present decentralized algorithms for finding an equilibrium, based on the 
well-known best-response dynamics (or Gauss-Seidel iterative) approach. While such an approach is only 
a heuristic method in general, convergence can be established for the special case when the agents have 
common source-target pairs. For more general cases, we obtain encouraging empirical results for the per¬ 
formance of the method on several classes of network structures. Third, in measuring the efficiency loss 
of DSPI games due to the lack of coordination among noncooperative interdictors, as compared to a cen¬ 
tralized interdiction strategy (that is, a strategy implemented by a single interdictor with respect to all the 
adversaries), we establish a theoretical lower bound for the worst-case price of anarchy of DSPI games 
under continuous interdiction. Such an efficiency loss measure, however, may be too conservative, and 
we therefore use the decentralized algorithms to empirically quantify the average-case efficiency loss over 
some instances of DSPI games. These results can help central authorities design mechanisms to reduce such 
efficiency losses for practical instances. 

The remainder of this paper is organized as follows. We begin in Section 2 with definitions and for¬ 
mulations of DNI games and DSPI games. In Section 3, we present the main theoretical results of the 
paper, including an analysis of the existence and uniqueness of equilibria in DSPI games. In Section 4 we 
investigate algorithms for solving DSPI games. We describe a centralized algorithm based on a linear com¬ 
plementarity formulation, as well as decentralized algorithms for computing equilibria of DSPI games. We 
also give results of our computational experiments with these algorithms for computing equilibria as well 
as quantifying the price of anarchy for various instances. Finally, in Section 5, we provide some concluding 
remarks. 

2 Decentralized Network Interdiction Games 

2.1 Formulation 

Network interdiction problems involve interactions between two types of parties - adversaries and interdic¬ 
tors - with conflicting interests. An adversary operates on a network and attempts to optimize some objec¬ 
tive, such as the flow between two nodes. An interdictor tries to limit an adversary’s objective by changing 
elements of the network, such as the arc capacities. Such interactions have historically been viewed from a 
leader-follower-game perspective. The interdictor acts as the leader and chooses an action while anticipating 
the adversary’s potential responses, while the adversary acts as the follower and makes a move after observ¬ 
ing the interdictor’s actions. From the interdictor’s perspective, this captures the pessimistic viewpoint of 
guarding against the worst possible result given its actions. 

In this work, we consider strategic interactions among multiple interdictors who operate on a common 
network. The interdictors may each have their own adversary or have a common adversary. If there arc 
multiple adversaries, we assume there is no strategic interaction among them. We also assume that the 
interdictors arc allies in the sense that they arc not interested in deliberately impeding each other. 

Formally, we have a set T = {1..... I 7 } of interdictors or agents, who operate on a network G = (V) ,4), 
where V is the set of nodes and A is the set of arcs. Each agent’s actions or decisions correspond to 
interdicting each arc of the network with varying intensity: the decision variables of agent / £ J are 
denoted by € X? C where X- f is an abstract set that constrains agent /’s decisions. For any 

agent / £ T, let x f denote the collection of all the other agents’ decision variables; that is, x * = 
^X , . . • , X^~ , X , . . . , X F ). The network obtained after every agent executes its decisions or interdiction 
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strategies is called the aftermath network. The strategic interaction between the agents occurs due to the 
fact that the properties of each arc in the aftermath network are affected by the combined decisions of all the 
agents. 

In addition to the abstract constraint set X we assume that each agent / G T faces a total interdiction 
budget of m. The cost of interdicting an arc is linear in the intensity of interdiction; in particular, agent /’s 
cost of interdicting arc (u, v) by x uv units is ci v x, w . Without loss of generality, we assume that b? > 0 and 
cl. v > 0 for each arc (u. v) E A and for each agent / G X. 

The optimization problem for each agent / G T is: 

maximize 9^(x^,x f) 

X f 

subject to Y c L x L< hf , (1) 

(u,v)£A 

x f E X f , 


where the objective function 6f is agent /’ s obstruction function, or measure of how much agent /’ s adver¬ 
sary has been obstructed. Henceforth, we refer to the game in which each agent / G T solves the above 
optimization problem (jT|) as a decentralized network interdiction (DNI) game. The obstruction function 0-1 
can capture various types of interdiction problems. Typically 6* is the (implicit) optimal value function of 
the adversary’s network optimization problem parametrized by the agents’ decisions, which usually mini¬ 
mizes flow cost or path length subject to flow conservation, arc capacity and side constraints. 

Suppose that a central planner, with a comprehensive view of the network and the agents’ objectives, 
could pool the agents’ interdiction resources and determine an interdiction strategy that maximizes some 
global measure of how much the agents’ adversaries have been obstructed. Let 0 c (x l ,..., x F ) represent 
the global obstruction function for a given interdiction strategy (x 1 ,..., x F ). The central planner’s problem 
corresponding to the DNI game (jT]) is then: 

maximize 6 c (x 1 ,..., x F ) 

x 1 ,...,x F 

subject to E E ^UV^UV sE b f , (2) 

/E*F (u,v)^A /E.? 7 

xf G x f V/ E T. 

We refer to ([2]) as the centralized problem, and focus primarily on when the global obstruction function is 
utilitarian ; that is, 

6 c {x l ,..., x F ) := Yj &, x~f). 

/e^ 

As mentioned earlier, one of the goals of this work is to quantify the inefficiency of an equilibrium of a DNI 
game - a decentralized solution to problem (|T| - relative to a centrally planned optimal solution - an optimal 
solution to problem ([2]). A commonly used measure of such inefficiency is the price of anarchy. Formally 
speaking, let Mi be the set of all equilibria corresponding to a specific instance I. (In the context of DNI 
games, an instance consists of the network, obstruction functions, interdiction budgets, and costs). For the 
same instance I, let (x l ,..., x F ~) denote a global optimal solution to the centralized problem ([2]). Then 
the price of anarchy of the instance I is defined as 


p(I) := 


0 c (x L ",.. .x 
max - . - 

(,x 1 N ,-,x F )eAt I S c (x ]s 


''N ’ 


, x n) 


(3) 


Let X be the set of all instances of a game. We assume implicitly that for all I E X, the set Mi is nonempty 
and a global optimal solution to the centralized problem exists. By convention, p is set to 1 if the worst 
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equilibrium as well as the global optimal solution to the centralized problem both have zero objective value. 
If the worst equilibrium has a zero objective value while the global optimal value of the centralized problem 
is nonzero, p is set to be infinity. In addition to the price of anarchy for an instance of a game, we also define 
the worst-case price of anarchy over all instances of the game (denoted as w.p.o.a ) as follows: 

w.p.o.a : = sup p{I). (4) 

/ex 

Since we wish to study the properties of a class of games such as DNI games, rather than a particular 
instance of a game, we arc more interested in the worst-case price of anarchy. However, there are two major 
difficulties associated with such an efficiency measure. First, it is well-known that the worst-case price 
of anarchy may be a very conservative measure of efficiency loss, since the worst case may only happen 
with pathological instances. Second, explicit theoretical bounds on the worst-case price of anarchy may be 
difficult to obtain for general classes of games. In fact most of the related research has focused on identifying 
classes of games where such bounds may be derived. In this work, we show how our proposed decentralized 
algorithms can be used to empirically study the average-case efficiency loss (denoted by a.e.l). Let X' denote 
a finite set such that X' C X, and let \X'\ denote the cardinality of the the set X'. Then 

a.e./(X / ) := P( 7 )- ( 5 ) 

' ' iei 1 

In other words, the average-case efficiency loss is the average value of p(I) as defined in ([3]) over a set of 
sampled instances X' C X of a game. 

As mentioned above, the generic form of problem (|T| can be used to describe various network inter¬ 
diction settings, such as maximum flow interdiction. To start with models that are both theoretically and 
computationally tractable, we focus on decentralized shortest-path interdiction games, which we describe in 
detail next. 

2.1.1 Decentralized Shortest Path Interdiction Games 

As the name suggests, decentralized shortest path interdiction (DSPI) games involve agents or interdictors 
whose adversaries are interested in the shortest path between source-target node pairs on a network. Inter¬ 
dictors act in advance to increase the length of the shortest path of their respective adversaries by interdicting 
(in particular, lengthening) arcs on the network. 

To describe these games form ally, we build upon the setup of the general decentralized network inter¬ 
diction game described in Section |2X) Each agent / E T has a target node t? £ V that it wishes to protect 
from an adversary at source node s? <E V by maximizing the length of the shortest path between the two 
nodes. The agents achieve this goal by committing some resources (e.g. monetary spending) to increase the 
individual arc lengths on the network: the decision variable x{ iv represents the contribution of agent / E T 
towards lengthening arc (u, v ) E A. The arc length d uv {x *, x •') of arc ( u. v) E A in the aftermath network 
depends on the decisions of all the agents. 

We consider two types of interdiction. The first type of interdiction is continuous : in particular, 

X? := {x-F £ : x£ v > 0 \/(u,v) £ A}. 

The arc lengths after an interdiction strategy (x 1 ...., x F ) has been executed are 

d uv (x 1 ,...,x F ) = d° uv + ^2x{ lv V(u,r)eA (6) 

/6-X 

where xtv captures how much agent / extends the length of arc (u,v). We assume that d\] v > 0 for all 

(u, v ) £ A. 
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The second type of interdiction is discrete : in this case, 

Xf ■— { x f ^ rI a I : x f uv ^ {o, 1} V(tt, v) E A} 

and the arc lengths in the aftermath network are 

duvix 1 ,... ,x F ) = d° uv + e uv maxxl v V(u,v) e A, (7) 

f''- 

where e uv E M>o is the fixed extension of arc (u, v ). In other words, the length of an arc is extended by a 
fixed amount if at least one agent decides to interdict it. 

Let P-f = {p{,P 2 ,,ptf } be the set of s? — t? paths available to agent / E T. The length of a path 
p E Pf is given by 

d p (x 1 ,...,x F ) = ^2 d uv (x 1 ,...,x F ), (8) 

where d uv {x x ,..., x F ) is as defined in equation ([6]) for continuous interdiction, and as defined in ([7]) for the 
discrete case. 

The optimization problem for each interdicting agent / E T is then: 

maximize (F (a/, x~*) = min d p (x\ x~f) 

xf PffPf 

subject to E ^UV^UV < bf i (9) 

(u,v )&A 
x S E X f . 

Under continuous interdiction and the general assumption made ealier that Xf is nonempty, convex and 
compact, the feasible strategy set for agent /, given by {xf E X? \ J2( u v )eA c uv x t,v < id ] is also convex 
and compact. To rule out uninteresting cases, we also assume that the feasible set for each agent is also 
nonempty (meaning that each agent has the budget to at least interdict one arc). Moreover, given an x~d, 
the objective function in ([9]) is the minimum of a set of affine functions of x f , and therefore continuous in 
xJ . Thus, by Weirstrass’s extreme value theorem, each agent has an optimal strategy given the strategies of 
the other agents. Note, however, that the objective function in Q is not differentiable with respect to xj in 
general. 

For DSPI games with discrete interdiction, the feasible strategy set for each agent is finite. Therefore 
an optimal solution to each agent’s problem always exists with a given x~P In the following section, 
we analyze the existence and uniqueness of pure strategy Nash Equilibria for DSPI games, under both 
continuous and discrete settings. 

3 Game Structure and Analysis 

3.1 Existence of Equilibria 

We first consider the existence of a Nash equilibrium of a DSPI game when interdiction decisions are 
continuous. The key is to show that the objective function in ([9]), 0^'{xJ\ x~d), is concave in x/, despite the 
fact that it is not differentiable. 

Proposition 1. Given that each agent f E T solves the problem <0. with d p (xf, x f ) defined as in 0 and 
0, and assume that the abstract set Xf in 0 is nonempty, convex and compact for each f E T, the DSPI 
game under continuous interdiction has a pure strategy Nash equilibrium. 
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Proof. Based on the assumption, the feasible region in Q is nonempty, convex and compact. With a fixed 
x~f , the objective function of agent / is the minimum of a finite set of affine functions in x\ and therefore, 
is concave with respect to x* , by the well-known fact in convex analysis (Cf. [jhj). Consequently, the DSPI 
game belongs to the class of “concave games,” introduced in Rosen 1351, and it is shown in 1351 that a 
pure-strategy Nash equilibrium always exists for a concave game. □ 


Under discrete interdiction, the existence of a PNE is not always guaranteed when different interdictors 
are competing against different adversaries. We illustrate the nonexistence of PNE in Example [T] below. 

Example 1. Consider the network given in Figure |T] 



Figure 1: Network topology for DSPI game in Example [T] 

In this game, there arc two agents - agent 1 and agent 2 - who arc attempting to maximize the lengths 
of the s 1 -t 1 paths and s 2 -t 2 paths respectively. Note that t 1 = t 2 . The data for the problem, including initial 
arc lengths, cost of interdiction and arc extensions are given below in Table [T] 


Arc tag 

Initial length 

Arc extension 

Cost to player 1 

Cost to player 2 

a 

7 

0.5 

3 

20 

b 

0 

2 

6 

20 

c 

0 

1.5 

5 

20 

d 

0 

6 

15 

15 

e 

0 

1 

20 

20 

f 

1 

6 

15 

15 


Table 1: Network data for Example 1 


Suppose b 1 = 8 and b 2 = 15. As a result, player 1 can either interdict the arcs a, b and c one at a time, 
or the arcs a and c simultaneously. Similarly, player 2 can either interdict arc d or arc /. 

Thus, player 1 has four feasible pure strategies and player 2 has two feasible pure strategies. The strategy 
tuples along with the corresponding pay-offs for each player are summarized in Table [2] It is easy to verify 
that for any joint strategy profile, there is a player who would prefer to deviate unilaterally. Therefore, this 
instance of the DSPI game does not possess a NE. 

In the previous example, the agents have a common target node, but different source nodes. However, in 
the class of games in which the interdictors have a common adversary, i.e., when each agent maximizes the 
shortest path between a common source-target pair, we can show that DSPI games under discrete interdiction 
possess a PNE. 

Consider the DSPI game where each agent is trying to maximize the shortest path lengths between 
nodes s and t. Since the objective function of each agent is the same, we can write the following centralized 
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P\ IP -2 strategies 

d 

/ 

a 

6,1 

0,0 

c 

7,1 

1.5,1.6 

(a,c) 

7.5,1 

1.5,1.5 

b 

7,1 

2,0 


Table 2: Pay-off combinations for Example [T] 


optimization problem to maximize the shortest s — t path distance subject to the individual agents’ budget 
constraints. Let P st be the set of s — t paths in the network. The centralized optimization problem is: 

maximize min d p (x 1 ,x 2 ,..., x F ) 

X p£P st 

subject to E ^uv^uv <b f V/ € P, (10) 

(il,v)£A 

x f uv £{Q,l} V(u,n) <E A,f e T. 

The feasible solution space of the above problem is finite under individual agents’ budget constraints. There¬ 
fore, the centralized problem always has a maximum. Furthermore, the optimal solution to this problem is a 
PNE of the DSPI game as we show in the following result. 

Proposition 2. Suppose the source and target for each agent in a DSPI problem are the same. Let x* denote 
the optimal solution of the centralized problem ©• Then x* is a PNE to the DSPI game under discrete 
interdiction. 

Proof. Assume the contrary, and suppose that there is an agent h for whom there exists a unilateral deviation 
x? that strictly increases the path distance s — t. By assumption, x h is feasible to the budgetary constraints 
for agent h. Therefore, x = (x h , x*~ h ) is feasible to ( fT0| ) with a strictly larger objective value. Clearly this 
is a contradiction to the optimality of x* to (|T0j). □ 


3.1.1 Uniqueness of equilibria 


Establishing sufficient conditions for a DSPI game to have a unique equilibrium is quite difficult. However, 
it is easy to find simple instances of DSPI games for which multiple equilibria exist. We give two such 
examples below. 


Example 2. Consider the following instance, based on the network in Figure [2] There are 2 agents: agent 1 
has an adversary with source node 1 and target node 5; agent 2 has an adversary with source node 1 and 
target node 6. The initial arc lengths are 0, interdiction is continuous, and the interdiction costs are the same 
for both agents and are given in the arc labels in Figure [2] Both agents have a budget of 1. Consider the case 
when e = 2. In this case, it is straightforward to see that the source-target path lengths for each agent must 
be equal at an equilibrium: if the path lengths arc unequal, an agent could improve its objective function by 
equalizing the path lengths. Therefore, in this example, any combination of decision variables that results in 
a shortest path length of 2/3 for each agent will be a generalized Nash equilibrium, and there is a continuum 
of such decision variable combinations. Indeed some of such equilibria are given in Table [3]in Section 4.5 


Example 3. Under discrete interdiction on the same underlying network, an interesting situation occurs 
when e = 0, the budget is 1, and the arc extensions are ah set to 1. In this case, an equilibrium occurs 
when the arcs (1,4) and (1, 2) are interdicted by one agenteach. What is interesting however is that there 
exist equilibria that have inferior objective values for both agents. Indeed, the extreme case of neither agent 
interdicting any arc can easily seen to be an equilibrium. This point in fact is a social utility minimizer over 
the set of feasible action combinations for the two agents. 











Figure 2: Network topology for DSPI game in Example [2] 

4 Computing a Nash Equilibrium 

In this section we discuss algorithms to compute equilibria of DSPI games. While the general formula¬ 
tion with each agent solving Q is sufficient for showing existence of equilibria, such a formulation is not 
amenable for computing an equilibrium mainly due to the ‘min’ function in the objective function. In this 
section, using a well-known reformulation of shortest path problems (through total unimodularity and linear 
programming duality), we obtain a generalized Nash equilibrium problem of the DSPI game. For contin¬ 
uous DSPI games, we further show that such a GNEP can be written as a linear complementarity problem 
(LCP) through the Karush-Kuhn-Tucker (KKT) optimality conditions. We then show that the resulting LCP 
has favorable properties, allowing the use of Lemke’s pivoting algorithm with guaranteed convergence to a 
solution (as opposed to a secondary ray). 

We refer to the LCP approach as a centralized approach, in the sense that the game is purely viewed 
as a system of equilibrium conditions, and a general algorithm capable of solving the resulting system is 
applied. We also present decentralized algorithms based on best-response dynamics, which are applicable to 
both continuous and discrete DSPI games. While not necessarily computationally more efficient, decentral¬ 
ized algorithms indeed have several advantages over centralized algorithms. First, a centralized algorithm 
usually cannot solve discrete games, which can nevertheless be solved by a decentralized approach, aided 
by integer programming solvers. Second, a decentralized algorithm may provide insight on how a partic¬ 
ular equilibrium is achieved among agents’ strategic interactions. Such insight is particularly useful when 
multiple equilibria exist, as is the case for many GNEPs. It is well-known (for example, |3TJ) that a game 
may possess unintuitive Nash equilibria that would never realistically be the outcome of the game. Third, 
a decentralized algorithm can naturally lead to multithreaded implementations that can take advantage of 
a high performance computing environment. In addition, different threads in a multithreaded implementa¬ 
tion may be able to find different equilibria of a game, making such an algorithm particularly suitable for 
computationally quantifying the average efficiency loss of noncooperative strategics. 

In the following discussion, we first present the GNEP formulation of DSPI games under continuous 
interdiction. We then reformulate the GNEP as an LCP and analyze the properties of the LCP formulation. 
Finally we present the decentralized algorithms for both discrete and continuous DSPI games formulated as 
GNEPs. 
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4.1 Dual GNEP formulation 


In ([9]), each agent’s objective function Qf {xK x *) involves its adversary’s shortest path problem, which 
can be written as an integer program as follows: 


0f(xf,x f) 


( min 

z f 


s.t. 


V 


X z Ld uv (x f ,x f ) 
( u,v)eA 


VV _Vz / 

/ J ^UV / J ^vu 

v£V v£V 


'1 
< 0 
-1 


zL e {0,1} V(u,v)eA 


if u = sf 
if u / sf ,tf 
if u = ff 


\ 


/ 


( 11 ) 


where the binary variables zl v represents whether an arc (u. v) E A is in the shortest sf -tJ path. Al¬ 
though the inner minimization problem is an integer program with binary variables, it is well known that 


the constraint matrix is totally unimodular (e.g. [381), rendering the integer program equivalent to its linear 


programming relaxation. Therefore, once the interdictors’ variables (x 1 ,..., x F ) are fixed, we can use lin¬ 
ear programming duality to transform the inner minimization problem to a maximization problem [|24j] and 
reformulate agent /’s optimization problem Q as: 


maximize 

xf , yf 


f 

y t f 


f 

y s f 


subject to yf -yf< d uv (x f , x f ) V(u, v) E A, 


ci,„xu < b f , 


J r f 

-'UV'^UV 


( 12 ) 


E 

(u,v)eA 

x f E X f , 

yf > 0 \/v E V. 

It is also well-known (see, for example, [5,27]) that at optimality, the term yl — yf f is equal to the length 
of the shortest sf-u path in the aftermath network. This is the reason why we are able to restrict the yf 
variables to be non-negative. In addition, it also allows us to restrict the yf variables to be integral if the 
underlying network data is integral, since at optimality all path lengths would also be integral. Moreover, as 
we show below, it also allows us to bound the yf variables. 

When interdiction is continuous, the largest possible length in the aftermath network for any arc is 
bounded by the largest interdiction possible on that arc. Keeping the budgetary constraints in mind, the 
maximum length of any arc (u, v) £ A in the aftermath network is bounded by 

[ bf 

+ F ■ max < — r 

(u,v)eA { d v . 


d° 


Therefore, the length of any path in the aftermath network is bounded above by 

bf 


M = X 4L + \A\ F ■ max , , 

/SJ 7 , a£A 1 (J 


(u,v)eA 


On the other hand, when interdiction is discrete, the length of any path in the aftermath network is bounded 
above by M = J2( u ,v)eA( d uv + e uv)- 

Since only the differences yl — yl across arcs (u, v ) are relevant to the formulation ( fT2| ), we may always 
f f f 

replace y u by y u — y s f° r eac h « £ kto obtain a feasible solution with equal objective value. Therefore 


10 








we can then add the constraints 0 < yl < M for all u £ V to the problem ( p~2| ) to obtain an equivalent 
formulation of a DSPI game, where each agent / £ T solves the following problem: 


maximize 

xf , yf 


f 

y t f 


f 

y S f 


subject to yl -y f u < d uv (x f , x f ) V(u, v ) £ A, 


Y C uv x L < bf i (13) 

(il,v)eA 

0 < yl < M \/u £ V, 
x f £ X f . 


When analyzing the DSPI game from a centralized decision-making perspective, we assume that the 
global obstruction function is utilitarian, i.e., the sum of the shortest s*-t* path lengths over all the agents / £ 
T. We also assume that the resources are pooled among all the agents, resulting in a common budgetary 
constraint. Thus the centralized problem for DSPI games can be given as follows: 

maximize Y (y( f ~ y{ s ) 

’ * f£F 

subject to yl -yl< d uv (x f , x~ f ) V(it, v) € A, f 6 J, 

Y Y c uv x L <Y bf (14) 

(u,v)€A 

0 <yl<M VueVJer, 
x f eX f V/ € T. 

Since yJ is bounded for all / £ J 7 , the feasible set for (Td] ) is a compact set. Thus a globally optimal solution 
exists regardless of whether xf is continuous or discrete for all / £ T. In the continuous case, Weierstrass’s 
extreme value theorem applies since all the functions are continuous and the xf variables are bounded due 
to the non-negativity and budgetary constraints. In the discrete case, there are only a finite number of values 
that the xf variables can take. 

The formulation ( fl3j ) gives us some insight into the structure of strategic interactions among agents in 
a DSPI game. Note that in formulation < [T3|) , the objective function for each agent / e T only depends 
on variables indexed by / (in particular, y } f and yf f ). However, the constraint set for each agent / is 
parametrized by other agents’ variables x~f , which leads to a generalized Nash equilibrium problem. 

Formally speaking, consider a simultaneous-move game with complete information[j] As before, let 
T = {1,..., F} denote the set of agents. Let the scalar-valued function Of (\f, ) t> e th e utility function 

of agent / £ 7, which is a function of all the agents actions (x> x 0- Tf> c feasible action space of agent 
f £ X is a set-valued mapping EP (x 0 with dimension nj (in a regular Nash equilibrium problem, each 
agent’s feasible action space is a fixed set). Let n := nj. Then r.f (•) is a mapping from to 

R"-f. Parametrized by the other agents’ decisions x~> eac 'li agent f £ T in a GNEP solves the following 
problem: 

,X 

(15) 

subject to x f € Z f (x f )- 

'A game is said to be simultaneous-move if the agents must make their decisions without being aware of the other agents’ 
decisions. A game has complete information if the number of agents, their payoffs and their feasible action spaces are common 
knowledge to all the agents. 


maximize e f (x f 

x f 
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It is straightforward to see how the DSPI game in ( [T3| ) translates into a GNEP problem: for all / E J-, 

X f = (x f ,y f ), 

0 f (x f ,x~ f ) = y [ s - y f s f, 

yl - y{ < d uv (x f ,x~ f ) 
v c f x f < b f 

(u,v)gA 

0 < y[ < M \/u € V, 
x f E X s 

Note that \ = (x 1 , • • ■ ,X F ) G K n , where n = F (|V| + |A|). 

To formally define a Nash equilibrium to a GNEP, we let <)(y) denote the Cartesian product of the 
feasible sets of each agent corresponding to decisions x = (x 1 ; • ■ •, X F )\ th at 

O(X) : = ^(X -1 ) x H 2 (x^ 2 ) x • • • x E f (x~ F )- (17) 

For a simultaenous-move GNEP with each agent solving problem ( p3T ), a generalized Nash equilibrium is 
defined as follows: 

Definition 1. A vector xn = (Xjv’ • • • >Xn) ^ ( -' > -(xn) is a pure-strategy generalized Nash equilibrium 
(PGNE) if for each agent / G T, 

0 f {x f N , Xn 1 ) > 0 f (x f , Xj/), Vx f e S^(x^). (18) 

Based on the above definitions and discussions, it is easy to see that if ( x , y) is an equilibrium to a DSPI 
game formulated as a GNEP using the primal-dual formulation, then x is must be an equilibrium to the DSPI 
game using only the primal formulation. Such a relationship is formally stated below. 

Proposition 3. Suppose that \ = (x. y) £ M Fx (l" 4 l+l 1/ l) is a PGNE to the GNEP where each agent solves 
( |16[ ). Then x is a PNE to the DSPI game where each agent solves <|9]). □ 

For the remainder of the paper, we will mainly use the GNEP formulation, as our focus is shifting from 
establishing theoretical properties of DSPI equilibria to computing such an equilibrium. For continuous 
interdiction, the GNEP formulation can be further reformulated as an LCP, as we show below. On the 
other hand, under discrete interdiction, we show that certain classes of DSPI games admit provably conver¬ 
gent decentralized algorithms. In this case, we sequentially solve agents’ problems ( fT6] ) using an integer 
programming solver. 

4.2 Linear Complementarity Formulation 

Before presenting the LCP formulation for the DSPI game, we introduce some basic notation and definitions. 
Formally, given a vector q G R d and a matrix M € R dxd , a linear complementarity problem LCP(q, M) 


consists of finding a decision variable vector w € R d such that 

w > 0, 

(19) 

q + Mw > 0, 

(20) 

w T (q + Mw) = 0. 

(21) 


The LCP(7/. M) is said to be feasible if there exists a w G W 1 that satisfies < [T9| ) and ( |20| ). Any w satisfying 
m i s called complementary. If w is both feasible and complementary, it is called a solution of the LCP, 


= / (X" / ) = 


X s = (x f ,y f ) 


V(it, v) £ A,' 


( 16 ) 


. 
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and the set of such solutions is denoted by SOL(g, M). The LCP is said to be solvable if it has a solution. 
A thorough exposition of the theory and algorithms for LCPs can be found in [ 121. 

Consider now the DSPI game with continuous interdiction, introduced in Section |2.1.1[ where agent /’s 
optimization problem is given in ( fT2[ ). When the interdiction decisions of the agents f ^ f arc fixed, ( fT2| ) is 
a linear program (LP). In this case, the KKT conditions are both necessary and sufficient for a given feasible 
solution to be optimal. 

In order to present the LCP reformulation in a more compact form, we introduce the following notation. 
Let \V\ = n and A = m. Denote by Q the arc-node incidence matrix of the graph G. Further let X denote 
an identity matrix, and 0 be a vector or a matrix of all zeros, of appropriate dimensions, respectively. The 
objective function coefficients for the LP ( |T2] ), denoted by 4>f G M m+n can be given as follows: 

if u = sf 

0 m 

J 


<j)f = 


where 


yf = 


1 

0 

-1 


if tt / s ?, t-f 
if u = tf 


The right hand sides for the constraints are denoted as the vector rf (x f) G M m +1 : 

~-<P 
-bf 


rf{x *) = 


E 


f eT l 
f'H 


X m 

Omxn 

0 1 



xf 

r 


v 


The constraint matrix itself, denoted as A/ G M( m + 1 ) x ( m + n ) > is 

Af = 


T 

- u m 

1 

O) 

fT 

—c J 

ol _ 


Using this notation, the LP ( |I2| ) can be restated as follows: 

rf 


minimize 

x f ,yf 


subject to 


tf 1 
A f x 


>rf(x f), 


( 22 ) 


> 0. 


Let the dual variables for the LP ([22]) be (A^ , /3^\ vJ ), where are the multipliers for the arc potential 
constraints, 13f the multiplier for the budgetary constraint and the multipliers for the non-negativity 
constraints. The KKT conditions for ([22]) are given by the following system. 


r-^ (x f) < Af 


_L 


V 

pf 


> o, 


(j)f - Af 1 


o < 

A f 

P f 


— yf 


_L yf > 0, 


= 0. 


(23) 


The KKT system ( |23[ ) can be rewritten in the following form: 

[A f 


jf = <j>f - Af 


Pf 


tf = —rf (x f) + Af 


> 0 


xf 

y f . 

>0, 

xf 

yf 

T 

y 

xf 

_y f _ 

>o, 

'A f 

p f _ 

IV 

o 

c*f» 

'-b 

l l 

1 _ 1 


= 0, 


= 0. 


(24) 
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In this form, it is easy to recognize that for a fixed value of x f , the KKT system is equivalent to the 
LCP (qf M-f) where 


qf (x ■') = 


<f>f 

r /( x -/) 


and Mf 


0(m+n) x (m+n) 

-Af T 

Af 

0(m+l)x(m+l) 


(25) 


The decision variable vector for the LCP is the vector of combined decision variables vjf := 
(xf, yf, \ f, @f ) 7 . Each agent’s KKT system ( [24] ) is parametrized by the collective decisions of other agents. 
Now by stacking all agents’ KKT systems together, the resulting model is itself an LCP, which can be seen 
from the following algebraic manipulation. 

First consider the following system obtained from (|24j) by expanding rf(x~ f). 


/ = 0/- Af 1 


tf = 


V 


xf 


p f . 

>o, 

_y f _ 

> o, 

— ? 


d° 

Af 

l 


1 m 

Omxn 

bf 

I -r± 

y f 

/'e.F 

o 

_1 

ol 


f’Af 


x^ 

yf 


>o, 


\f 

P f 


xf 

yf 


-i T 


V 


f = 0, 


>0, tf 1 


A f 

P f 


= 0. 


(26) 


Now introduce a matrix Mf to represent the interactions between agent /’s decision variables (xf , yf) and 
the KKT system of all the other agents, which has the following specific form: 


Omxm 

Omxn 

Omxm 

Omx 1 

Onxra 

Onxn 

Onxm 

Onx 1 

1m 

0 mxn 

Omxm 

Omx 1 

^lxm 

Olxn 

Olxm 

0 


(27) 


Using this notation, the stacked KKT system ( |26| > for agents / = l,.... F can be formulated as an 
LCP(q, M), with the vector q and matrix M given as follows: 


q = (q\q 2 ,...,q F ) T , where q f = (<j)f , d° ,b f ) T , 


(28) 


and 


M = 


M 1 

M 2 

M 3 


m f 

M 1 

M 2 

M 3 


m f 






M 1 

M 2 


M f ~ 1 

m f 


(29) 


Due to the equivalence between an agent /’s optimization problem ( [T2| l and its KKT system (24), the above 
LCP(g, M) is equivalent to the corresponding (continuous) DSPI game in the sense that a candidate point 
(x 1 )X 2 ; • • ■ iX F )> where x^ = ( x ^, is an equilibrium to the DSPI game if and only if there exist 
associated Lagrangian multipliers such that they together solve the LCP(q, M). 

Methods for solving LCPs fall broadly into two categories: (i) pivotal methods such as Lemke’s algo¬ 
rithm, and (ii) iterative methods such as splitting schemes and interior point methods. The former class of 
methods are finite when applicable, while the latter class converge to solutions in the limit. In general, the 
applicability of these algorithms depends on the structural properties of the matrix M. In the following 


analysis, we show that LCP(q, M) for the DSPI game, as defined in ( |28| ) and d29| ), possesses two prop¬ 
erties that allow us to use Le mk e’s pivotal algorithm: (i) the matrix M is a copositive matrix, and (ii) 
qe (SOL(0,M))*fl 

2 Given a set K £ R d , the set K* denotes the dual cone of K; i.e. K* = {y £ l d : y T x >0, Vat £ A'}. 
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We first show that M is copositive. Recall that a matrix M E is said to be copositive if x T Mx > 0 
for all x E Ml. 


Lemma 1. M defined as in ( |29| ) is copositive. 

Proof. Let w E M^_ m+Tl+1 . Using the block structure of M given in ( [29] ), w T Mw can be decomposed as 
follows. 


F F F 

(T , 


W 


T ^— r f! ff 
W J . 


Mw = wf w f 

/=i /=i /'=l 

f'M 


(30) 


We analyze the terms in the two summations separately. First consider w ^ T for any agent /. Let the 
dual variables (A^, fi *) be collectively denoted by 5?. 


w 


f M*w f = 


X fT 5 fT 


0 

-Af T ' 

A* 

0 


X J 

6* 


= - x fT Af T 5 f + 5 fT Af x f = 0. 


Now consider any term of the form af V M? w-f': 


w 


f T Mf'wf = 


V f T yf T X f T 


f T yf T A f T ff 


0 

0 

0 

0 ' 

0 

0 

0 

0 

2 m 

0 

0 

0 

0 

0 

0 

0 _ 


. ft. 

x J 

yf 

\f 


0 

0 

f 

T.J 


= A f T xf. 


Combining (fTi~[) and (321 we obtain 


F F 


W 


fT /' 
X 1 . 


Mw = ^ 

/=! /'=1 

/¥/ 

Since \f’s and x^’s are the elements of w, w > 0 clearly implies that w T Mw > 0. 


(31) 


(32) 


(33) 


m 


We now show property (ii) of the LCP(c/, M); that is, q E (SOL(0, M))*. 

Lemma 2. Let the vector q and the matrix M be as defined in ( |28[ ) and ( |29[ ) respectively. Then q E 
(SOL(0,M))*. 


Proof. First note that SOL(0, M) f 0 for any M, since 0 is always a solution to LCP(0, M). Now consider 
a w E SOL(0,M); i.e. 0 < w _L 0 + Mw > 0. We prove that q T w > 0. Observe that q T w can be 
decomposed as follows: 


T 

q w = 


F f 

qf wf = 

/=! 

F 


£(V T HI +d ° T x f+bfpf) 

f =i ^ ^ J J 


(34) 


15 































The last two terms in the last equality above, d° T \f and b? /3 7 , can be easily seen to be nonnegative for 
f = 1,... F. This is so because w G SOL(0, M) implies that A 7 , (3? > 0, and by assumption d°, b? > 0 
for each / = 1,..., F. 

Now we focus on the first term in the last equality of ((34]): (yf/ — 1f[s) ■ First since Mw > 0, nf 

must solve the system (26i for / = 1.....F, with cf>f, d° and b* all set at zeros (as <j>f, d° and b? are the 
components of the vector q in the LCP(q, M), as defined in ( [28] ); and in LCP(0, M), q = 0). In this case, 
considering the primal feasibility of wf we obtain the following: 


c a x a ^ 0 


a£A 

F 


yl~Vv + ^2 x i,v > 0 V(ii,r)ei 

/=! 


for/ = l,...,F. 


(35) 


Recall that cl > 0 for all a G A and / = 1,..., F by assumption. Therefore, ( [33] ) implies that = 0 for 

any agent /. It is easy to see that in this case, we must have 


y[-yl> 0 V(u, v) G A, for f = l,...F. 


(36) 


Now consider any s 7 -f 7 path V*. By assumption, there must be at least one such path for each agent /. By 
summing up the inequalities ( [36] ) over the arcs in the path Vf we obtain the desired result. In other words, 

y f u~yl = y f s f - y{s > °- ( 37 ) 

(u,v)evf 


Summing up over the agents, we thus have shown that q 1 w > 0 for any w G SOL(0, M), which completes 
the proof. □ 

With Lemma [T] and [2] we can apply the following result from Cottle et al. fl2| . 

Theorem 1. (112], Theorem 4.4.13) If M is copositive and q G (SOLiq. M))*, then Lemke’s method will 
always compute a solution, if the problem is nondegenerate |^] 

As discussed earlier, the LCP approach is not applicable for discrete DSPI games due to the lack of 
necessary and sufficient optimality conditions. In the following we develop a decentralized approach that 
works for both discrete and continuous DSPI games. 

4.3 Gauss-Seidel Algorithm (Algorithm 1) 

We first present the basic form of a best response based algorithm. The idea is simple: starting with a 
particular feasible decision vector xo = (Xw A'o • • • •, Xo) C fl(xo), solve the optimization problem of a 
particular agent, say, agent 1, with all of the other agents’ actions fixed. Assume an optimal solution exists 
to this optimization problem, and denote it as x\* ■ The next agent, say, agent 2, solves its own optimization 
problem, with the other agents’ actions fixed as well, but with xj replaced by x [ *. Such an approach is often 
referred to as a diagonalization scheme or a Gauss-Seidel iteration, and for the remainder of this paper we 
use the latter name to refer to this simple best-response approach. 

Consider applying the Gauss-Seidel iteration to a GNEP, with each agent solving the optimization prob¬ 
lem ( fT3| ). denoted as Vixf )- The Gauss-Seidel iterative procedure is presented in Algorithm 1 below. 

Note that updates in agent /’s decisions occur at iteration k only if there is a strict increase in the agent’s 
payoff at the iteration. The algorithm can be directly applied to compute an equilibrium of a DSPI game with 

3 A detailed discussion of degeneracy and cycling in Lemke’s method can be found in Section 4.9 of | lij. 
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Algorithm 1 Gauss-Seidel Algorithm for a GNEP 

Initialize. Choose X o = (Xo> • • •, Xo ) with Xo G s/ (Xo f ) V/ G T . Set A: G- 0. 
Step 1: 

for / = 1,2,..., F do 

Set xlj <- (x3fc+i,• • ■ > xl+lx f k +1 ,■■■,Xk ); 

Solve ViXkf) to obtain an optimal solution Xlj, 

^ (xtj’Xkj) > 0 f (x f k , Xkj) then Set X f k+1 X%,f’ 
else Set x [ +1 <- x{; 

end if 
end for 

Set Xfc+i <- (xjfc+t) ■ ■ ■ 5 XjT+i)- 
Set k <— k + 1. 

if X k satisfies termination criteria, then STOP; else GOTO Step 1. 


discrete interdiction. For finite termination, we fix a tolerance parameter e and use the following stopping 
criterion: 

llXfc - Xfc-ill < e- (38) 

Proposition 4. Suppose that the Gauss-Seidel algorithm (Algorithm [7]) is applied to the DSPI game with 
discrete interdiction, and the termination criterion ( ]38| ) is used with e < 1 .If the algorithm terminates at 
Xk, then Xk > s an equilibrium to this problem. 

Proof. Since the variables Xk are integral for discrete interdiction problems, choosing e < 1 for the ter¬ 
mination criterion will ensure that the algorithm terminates only when successive outer iterates are equal. 
Consequently, by the assumption, Xk-i = Xk at termination. This also implies that Xk-i f = Xk ^ or 
/ = 1,..., F. By construction of Xk, we must then have 

Xk= argmin 6» / (x / ,Xfe / )- 

Clearly, X k must then be an equilibrium. P 


Even though there is no guarantee that the algorithm will in fact converge, we note that in the discrete 
case, it is possible to detect when the algorithm fails to converge. Recall that C /P for each agent 

/ G T, where /P is defined below. 


K f 


< ( x f ,y f ) 


r c /<(,/ 

/ J ^UV^UV — 1 

(u,v)ev > . 

0 < yl < M \/u G V, 


(39) 


Clearly, the set fl/Li ^ i s finite. Any intermediate point X k generated by Algorithm jlj must certainly 
satisfy the budgetary constraints on xj. and the bound constraints on y[ for each agent /. Therefore X k G 
I\f In other words, the set of possible points X k generated by Algorithm |lj lies in a finite set. This 
means that if the algorithm fails to converge, it must generate a sequence that contains at least one cycle. 
The existence of such cycles in non-convergent iterate paths can then be used to detect situations in which 
the algorithm fails to converge. 

Proposition[4]is likely the best one can do for general DSPI games under discrete interdiction. However, 
for the subclass of such games with common source-target pairs, we can in fact prove that the best response 
dynamics always terminates in a NE in a finite number of steps. 
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Proposition 5. Consider a DSPI game with discrete interdiction with common source-target pairs, and 
assume that the initial arc lengths d and arc extensions e are integral. Suppose dial A Igorilhm [7] is applied 


to such a problem, and the termination criteria (381 is used with e < 1. Then the algorithm will terminate 
finitely at an equilibrium. 

Proof. Denote the common source node as s, and the common target node as t. The set of joint feasible 
strategies in x under the given assumptions is a finite set. Moreover, all the agents attempt to minimize the 
common objective, namely the s-t path length. Note that at any iteration k at which an update occurs for any 
agent’s decision, there must then be a strict increase in the s-t path length. Thus there can be no cycles in the 
sequence {xfc}- Furthermore, since the set of joint feasible strategies is finite, the sequence must terminate 
at some point \*. It is easy to show that x* must be an equilibrium (cf. Proposition |4|». □ 


4.4 Regularized Gauss-Seidel Algorithm (Algorithm 2) 

One disadvantage of the “naive” Gauss-Seidel algorithm described above is that for continuous GNEPs, it 


can fail to converge to equilibria. However, Facchinei et al. [19] showed that under certain assumptions, 
we can overcome this issue by adding a regularization term to the individual agent’s problem solved in a 
Gauss-Seidel iteration. 

The regularized version of the optimization problem for agent / £ T is 


maximize 

x s 

subject to 


o f (x f ,x f )~ T 
X f 6 E f (x~ f ), 


X S ~ X S 


(40) 


where r is a positive constant. Here the regularization term is evaluated in relation to a candidate point 
X . Note that the point x? and the other agents’ decision variables x ^ are fixed when the problem (|40l> i 


is 


solved. We refer to problem ( |40| ) as 72(x The regularized Gauss-Seidel procedure, herein referred 

to as Algorithm 2, is simply Algorithm 1, except that 7 Z(x~^,X*) is solved in each iteration k instead of 
P(Xk f f ), with Xf given by xf, for each /. 


This version of the algorithm, along with its convergence proof, was originally presented in [19] to solve 
GNEPs with shared constraints. The difficulty here that prevents us from showing convergence lies in the 
fact that we are dealing with GNEPs with non-shared constraints. As a result, any intermediate points re¬ 
sulting from an agent’s best responses need not to be feasible in the other agents’ problems. Consequently, 
we use Algorithm 2 only as a heuristic algorithm to solve DSPI games under continuous interdiction. Nev¬ 
ertheless, we can show that if Algorithm 2 converges, then the resulting point is an equilibrium to the DSPI 
game. 

Proposition 6. Let {Xk} l ,e the sequence generated by applying Algorithm 2 to the DSPI problem under 
continuous interdiction, wherein each agent solves Suppose { Xk } converges to Then x ls cm 

equilibrium to the DSPI problem. 


The proof of this proposition is almost identical to that of Theorem 4.3 in Facchinei et al. 119]. However, 


we do want to point out one key difference in the proof. In Proposition [6j we need to assume that the 
entire sequence {xfc} converges to % This is a strong assumption in the sense that it also requires that 
all the intermediate points Xk,f i n Algorithm 2 to converge to x, a fact key to proving that x is indeed an 
equilibrium. In contrast, for GNEPs with shared constraints, this assumption may be weakened because 
the intermediate points Xk,f ar| d therefore the cluster points of the sequence generated by the algorithm are 
guaranteed to be feasible. The complete proof is presented in Appendix |A| 

Similar to discrete DSPI games, the convergence of Algorithm 2 can be guaranteed for continuous- 
interdiction DSPI games with common source-target pairs. The key fact that allows us to prove this stronger 
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result is that by dropping the dependence of the variables y on the agents / E F, any unilateral deviation in 
the shared variables y results in a solution that remains feasible in the other agents’ optimization problems. 
The convergence result is formally stated below. 

Proposition 7. Consider applying Algorithm 2 to the DSPI problem under continuous interdiction with 
common source-target pairs, where each agent solves ©• Let { Xk } be the sequence generated by the 
algorithm. If x is a cluster point of this sequence, then it also solves the DSPI problem. □ 


4.5 Numerical Results 


We use the algorithms presented in the previous section to study several instances of DSPI games. The 
decentralized algorithms were implemented in MATLAB R2010a with CPLEX vl2.2 as the optimization 
solver. The LCP formulation for the DSPI game with continuous interdiction was solved using the MATLAB 
interface for the complementarity solver PATH [20]. Computational experiments were carried out on a 


desktop workstation with a quad-core Intel Core i7 processor and 16 GHz of memory running Windows 7. 

In the implementation of the decentralized algorithm, for DSPI games with discrete interdiction, we used 
Algorithm [I] with r = 0. For DSPI games with continuous interdiction, we followed a strategy of trying 
the “naive” Gauss-Seidel algorithm - i.e. Algorithm [T] with r = 0 - first. If this version failed to converge 
in 1000 outer iterations, we then set r to a strictly positive value and used the more expensive regularized 
Gauss-Seidel algorithm. 


Computing Equilibria 

First, we applied the algorithm to Example [2] in Section |3.1.1[ which is a DSPI game with continuous 
interdiction. In particular, the network is given in Figure [2] and there are 2 agents: agent 1 has an adversary 
with source node 1 and target node 5, and agent 2 has an adversary with source node 1 and target node 6. 
Both agents have an interdiction budget of 1. The initial arc lengths are 0, and the interdiction costs are equal 
for both agents and are given as the arc labels in Figure [2j with e = 2. We set the regularization parameter 
t = 0.01. We were able to obtain a solution within an accuracy of 1CT 6 in 3 outer iterations. 

Furthermore, we obtained multiple Nash equilibria by varying the starting point of the algorithm. All the 
equilibria obtained resulted in the same shortest path lengths for each agent. Some of the equilibria obtained 
are given in Table [3] The column xo represents the starting interdiction vector for each agent, the columns 
x l N and x'fr give the equilibrium interdiction vectors for agents 1 and 2, respectively. The seven components 
in the vectors of xq, xjy and x 2 N represent the interdiction actions at each of the seven arcs in Figure |5j with 
the arcs being ordered as follows: first, the top horizontal arcs (1, 2) and (2, 3), then the vertical arcs (1,4), 
(2,5) and (3,6), and finally the bottom horizontal arcs (4,5) and (5,6). The remaining two columns in 
Table [3], p\ and p2, give the shortest path lengths for agents 1 and 2 respectively, at the equilibrium xn- 


Table 3: Multiple equilibria for the instance of the DSPI game in Example 2 

””zo Xn Xn Pi P2 


( 0 , 0 , 0 , 0 , 0 , 0 , 0 ) 
( 0 . 2 , 0 . 2 , 0 , 0 , 0 , 0 , 0 ) 
( 0 , 0 , 0 , 0 , 0 , 0 . 2 , 0 . 2 ) 
(0,0, 0,0, 0,0.3, 0.3) 
(0.3, 0.3, 0,0, 0,0,0) 
(0.25,0.25,0,0,0,0,0) 
(0,0,0,0,0,0.25,0.25) 
(0,0,0,0,0,0.15,0.15) 
(0.15,0.15,0,0,0,0,0) 


(0,0, 0.5,0.5, 0,0,0) 
(0,0, 0.6,0.4, 0,0,0) 
(0,0, 0.4,0.6, 0,0,0) 
(0,0,0.35,0.65,0,0,0) 
(0,0,0.65,0.35,0,0,0) 
(0,0,0.625,0.375,0,0,0) 
(0,0,0.375,0.625,0,0,0) 
(0,0,0.425,0.575,0,0,0) 
(0,0,0.575,0.425,0,0,0) 


(0, 0, 0.1667, 0.1667, 0.6667, 0) 0.6667 

(0, 0, 0.0667,0.2667,0.6667, 0) 0.6667 

(0,0, 0.2667,0.0667,0.6667, 0) 0.6667 

(0, 0, 0.3167,0.0167,0.6667, 0) 0.6667 

(0, 0, 0.0167,0.3167,0.6667, 0) 0.6667 

(0, 0, 0.0417,0.2917,0.6667, 0) 0.6667 

(0, 0, 0.2917,0.0417,0.6667, 0) 0.6667 

(0,0, 0.2417,0.0917,0.6667, 0) 0.6667 

(0, 0, 0.0917,0.2417,0.6667, 0) 0.6667 


0.6667 

0.6667 

0.6667 

0.6667 

0.6667 

0.6667 

0.6667 

0.6667 

0.6667 
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Figure 3: Network structure for DSPI Example [4] 


Example 4. To test the algorithm on larger-scale problems, we expanded the instance in Example[2]to larger 
network sizes and numbers of agents. For F agents, the graph contains 2 (F + 1) vertices with the edges as 
shown in Figure [3] The source vertex for all agents is a\. The target vertex for a given agent / is 6 /+i- The 
initial arc lengths are all assumed to be zero. The interdiction costs are the same for all the agents and are 
given as the arc labels in Figure [3] All the agents have an interdiction budget of 1. The cost parameter e is 
chosen as 2. For discrete interdiction on these graphs, the arc extensions are assumed to be length 1. 

The running time and iterations required to compute equilibria for these instances are summarized in 
Table [4j The first four columns in the table give the number of outer iterations and runtime for Algorithm [T] 
over these instances with continuous interdiction. The results indicate that the running time for the central¬ 
ized Femke’s method increases monotonically with the problem size. Flowever, the running time for the 
decentralized method depends not just on the problem size but also on the number of outer iterations. In 
general, there is no correlation between these two parameters. Indeed the algorithm is observed to converge 
in relatively few iterations even for some large problem instances. This is in stark contrast to the rapid 
increase in running time observed for the FCP approach as problem size increases. 

It must be noted that the order in which the individual agent problems are solved in the Gauss-Seidel 
algorithm plays an important role. Indeed it was found that the algorithm failed to converge for certain 
orderings of the agents, but succeeded in finding equilibria quickly for the same instance with other order¬ 
ings. For instance, for a network of size 25, solving the agent problems in their natural order {1,2,..., 25} 
resulted in the failure of the “naive” version of the algorithm to converge even after 1000 outer iterations. 
However, with a randomized agent order, the algorithm converged in as few as 13 iterations. It is encour¬ 
aging to note that for the same agent order that resulted in the failure of the naive version, the regularized 
method converged to a GNE within 394 outer-iterations with a runtime of 28 wall-clock seconds. 

Computation of Efficiency Losses 

Using the decentralized algorithm and its potential to find multiple equilibria by starting at different points, 
we empirically study the efficiency loss of decentralized interdiction strategies in DSPI games. We focus 
first on Example [4j with the underlying network represented in Figure [3] Before computing the average 
efficiency losses, we first establish a theoretical bound on the worst-case price of anarchy, for the purpose 
of comparison. 

We start with the specific instance as depicted in Figure [3] Recall that there are F agents and the source- 
target pair for agent / is (ai, bf + 1 ). Note that all paths for all agents contain either the arc (ai, a 2 ) or the arc 
(ai, bi). Then one feasible solution to the centralized problem is for each agent to interdict both these arcs 
by 1/(2 + e) for a total cost of 1. In this case the length of both arcs become F /(2 + e), giving a shortest 
path length of F /(2 + e) for each agent. Note that this is not an equilibrium solution as agent 1 can deviate 
unilaterally to interdict arcs (ai, b\) and ( 02 , 62 ) by 1/2 to obtain a shortest path length of (i 71 + e/2)/(2 + e). 

A Nash equilibrium to this problem is given by the following solution. Agent / interdicts the vertical 
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Table 4: Number of iterations and running times for DSPI Example 3. 



Continuous Interdiction 

Discrete Interdiction 


Decentralized 

LCP 

Decentralized 

# Agents 

# Iters Runtime (s) 

Runtime (s) 

# Iters 

Runtime (s) 

5 

3 

0.0205 

0.0290 

5 

0.1776 

10 

5 

0.0290 

0.1833 

3 

0.1627 

15 

11 

0.1103 

0.7534 

3 

0.2419 

20 

5 

0.0723 

2.1106 

3 

0.3164 

25 

13 

0.2609 

4.8167 

3 

0.4005 

30 

15 

0.4070 

10.2256 

3 

0.5155 

35 

10 

0.3605 

17.7387 

3 

0.5948 

40 

41 

1.7485 

30.2382 

3 

0.7387 

45 

12 

0.6601 

48.6280 

3 

0.8794 

50 

12 

0.7981 

75.0420 

3 

1.0385 


arcs (ai, bi ),..., (a/, bf) by 1 /(/(/ + 1)) and the arc (oy + i, &/+i) by //(/ + 1). Each agent then has 
a shortest path length of F/(F + 1). Note that all the s*-t^ paths are of equal length for every agent. 
Therefore diverting any of the budget to any vertical arcs will result in unequal path lengths and a shorter 
shortest path for any agent. Obviously, diverting the budget to interdict any of the horizontal arcs is cost 
inefficient because of their higher interdiction cost at 1 + e. Thus no agent has an incentive to deviate from 
this solution. 

We now have a feasible solution to the centralized problem that has an objective value of F/ (2 + e) for 
each agent, and a Nash equilibrium that has an objective value of F/(F + 1) for each agent. Therefore, by 
its definition in ([4}, the worst-case price of anarchy for the DSPI game depicted in Figure [3]must be at least 

(F + l)/(2 + e). 

Using the regularized Gauss-Seidel algorithm we also compute lower bounds on the worst-case price of 
anarchy and average efficiency losses for the same network topology with varying number of agents. The 
instances we consider arc obtained by varying e uniformly in the range of (1.5,10). For the purpose of 
comparison, the numerical results are plotted in Figure [4] below. Note that the average-case efficiency loss 
is much lower than the worst-case price of anarchy. For the particular graph structure under consideration, 
we observe that the average efficiency loss grows at a much lower rate than the worst-case efficiency loss. 
However this observation cannot be generalized to other graph structures and such patterns may only be 
discernible by applying a decentralized computational framework as the one we presented. 

Example 5. We further tested the decentralized algorithms for continuous interdiction on random graphs 
to study average efficiency losses of equilibria of DSPI games on networks with different topologies. For 
the random graphs we generated, the input parameters include the number of vertices and the density of 
a graph, which is the number of arcs divided by the maximum possible number of arcs. The number of 
agents was chosen randomly from the interval (0, |U|/2), and one such number is chosen per vertex set 
size. Source-target pairs were chosen at random for each interdictor. Fixing the vertex set, we populated the 
arc set by successively generating source-target paths for the agents until the desired density was reached. 
We thus ensured connectivity between the source-target pairs for each agent. Costs, initial arc lengths 
and interdiction budgets were chosen from continuous uniform distributions. Arc interdiction costs were 
assigned uniformly in the range [1,5]. The budget for each agent / was chosen uniformly from the interval 
[6^/10, ///2], where f/ = J2 a eA c " • The initial length of each arc was chosen uniformly from [1, 5]. 

For each combination of vertex set size, the number of agents, and graph density, we generated 25 ran¬ 
dom instances by drawing values from the uniform distributions described above for the various network 
parameters. For each instance, we used 10 different random permutations of the agents to run the decentral- 
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Figure 4: Efficiency loss with respect to the number of agents. 


ized algorithms in an attempt to compute multiple equilibria. The lower bound on the price of anarchy for 
the game was computed as the worst case efficiency loss over these 25 instances. The average efficiency loss 
over these instances was also computed. The results are summarized in Table [5] Our experiments indicate 
that the average efficiency loss and the worst-case price of anarchy tend to grow as the number of vertices 
and number of agents increases; on the other hand, these measures of efficiency loss sometimes do not 
appear to be monotonically increasing or decreasing with respect to the density of the underlying network. 


Table 5: DSPI Continuous Interdiction - Random Graphs 


# Vertices 

# Agents 

Density 

Avg. Run Time (s) 

# Avg Iters. 

a.e.l 

p.o.a 

5 

3 

0.25 

0.0037 

3 

1.3133 

1.5561 

5 

3 

0.5 

0.0038 

3 

1.3265 

1.9529 

5 

3 

0.75 

0.0040 

3 

1.5099 

2.3829 

10 

3 

0.25 

0.0065 

4 

1.5366 

2.2078 

10 

3 

0.5 

0.0176 

11 

1.4538 

2.3114 

10 

3 

0.75 

0.0132 

8 

1.4273 

2.1342 

15 

4 

0.25 

0.0263 

11 

1.7091 

2.9246 

15 

4 

0.5 

0.0939 

33 

1.7524 

2.7904 

15 

4 

0.75 

0.1267 

42 

1.5695 

2.1425 

20 

5 

0.25 

0.1269 

34 

2.1907 

3.2885 

20 

5 

0.5 

0.2087 

43 

1.8523 

2.7906 

20 

5 

0.75 

0.5416 

100 

1.7967 

2.3782 

25 

7 

0.25 

0.7167 

105 

2.5631 

4.8788 

25 

7 

0.5 

1.9564 

207 

2.3022 

5.5794 

25 

7 

0.75 

1.8476 

158 

1.9884 

2.4423 


5 Conclusions and Future Work 


In this work, we introduced decentralized network interdiction (DNI) games and gave formulations for one 
such class of games - decentralized shortest path interdiction (DSPI) games. We analyzed the theoretical 
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properties of DSPI games: in particular, we gave conditions for the existence of equilibria and examples 
where multiple equilibria exist. Specifically, we proved the existence of equilibria for general DSPI games 
under continuous interdiction. On the other hand, for the discrete counterpart, we provide a counterexample 
for existence. However, for the subclass of problems with common source-target pairs, we are able to 
provide an existence guarantee. 

We also showed that the DSPI game under continuous interdiction is equivalent to a linear comple¬ 
mentarity problem, which can be solved by Lemke’s algorithm. This constitutes a convergent centralized 
method to solve such problems. We also presented decentralized heuristic algorithms to solve DSPI games 
under both continuous and discrete interdiction. Finally, we used these algorithms to empirically evaluate 
the worst case and average efficiency loss of DSPI games. 

There are other classes of network interdiction games that can be studied using the same framework 
we have developed, where the agents’ obstruction functions are related to the maximum flow or minimum 
cost flow in the network. Establishing theoretical results and studying the applicability of the decentralized 
algorithms to other classes of decentralized network interdiction games are natural and interesting extensions 
of this work. 

In our study of DSPI games, we also made the assumption that the games have complete information 
structure; that is, the normal form of the game - the set of agents, agents’ feasible action spaces, and their 
objective functions - is assumed to be common knowledge to all agents. In addition, we made the implicit 
assumption that all input data are deterministic. However, data uncertainty and lack of observability of other 
agents’ preferences or actions are prevalent in real-world situations. For such settings, we need to extend 
our work to accommodate games with exogenous uncertainties and incomplete information. 

One might also be interested in designing interventions to reduce the loss of efficiency resulting from 
decentralized control. This leads to the topic of mechanism design. Such a line of work also defines a very 
important and interesting future research direction. 
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A Proof of Proposition [6] 


Since Xk —■► X we must have x{ and 


lim 

k—> oo 


x { + 1 



By construction of Xk,f> (FT]) implies that 


(41) 


lim Xk,f = X ■ (42) 

k — yoo 

By Step 1 of Algorithm 2, xl+i e Xkff )• Since x{+i X f > Xk/ X~ f > and -/Xkj) is defined 
by lineal - inequalities parametrized by x~k /> I s straightforward to see by continuity arguments that ;\4 e 
(xT *)• I n other words, % I s feasible for every agent’s optimization problem ( [15] ). 

We claim that for each agent / e T 

Q f (x f ,x~ f ) > 6 f (x f ,x~ f ), V x f e E f (x~ f ). 
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For the purposes of establishing a contradiction, let there be an agent / and a vector E E7(\ 4) such 
that 

^(x / ,X" / )<^ / (e / , re¬ 
using the linearity of the functions that define the set valued mapping H* (■) we can show its inner semicon¬ 
tinuity relative to its domain (cf. [34] Chapter 5). Because x^ £ dom(E/(-)), we then have 


lim inf 3(£ f ) 2 H(x / "), 


where the limit in (|43l> is given by the following: 


lim inf 


7 - (f /_ ) = {V I /_ -> X /_ , 3ti{ -> it with u[ E H^(x fc /_ )} 


(43) 


(44) 


Since £4 E (x 4), equations ( j42] ), ( |43| ) and ( p44] ) allow us to construct a sequence E E/ (x k y) such 
that ef -A F as fc —> oo. 

Let d-’ = (F — Xj- Then by the subdifferentiality inequality for concave functions we must have 

« ,/ (x / ,x“ / ;d / )> 0. (45) 

Denote by <I>4 the regularized objective function for agent /’s subproblem. In other words, 


® f (x f ,X f ,z) = 6 f (x f ,X f )~T 


X f ~z 


We then have 


^(X^X f ,z-,d f ) = 6' f (x f ,X f ',d f ) -2r(x f - z) T d f . 


Note that xi+i ' s obtained by solving the problem 7 Z(x k pxl)- hi other words, xt + \ maximizes 
Xi.fiXk) over the set 34(% L). Since this is a concave maximization problem, we then apply 

"'j/ "'j/ 

first order optimality conditions to obtain the following. 


x fc) /, x[; (Cjfc - x[+i)) = ^’(Xfc+1, x fc) /, x[; (£/~ - x{+i)) 

+ 2r(x£ +1 - Xk f )(d - Xk+i) 


(46) 


< 0. 


Passing to the limit k —> oo, k E K and using ( [42] ) we obtain 

o > 6>' / "(x / ,x“ / ; - x / )) 


which contradicts (45 1 . 
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